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(57) Abstract: 

PROBLEM TO BE SOLVED: To provide an information 
retrieving method which efficiently and accurately 
retrieves information containing an image and to provide 
a record medium where an information retrieving program 
is recorded. 

SOLUTION: An automatic information collecting device 3 
collects data including images on WWW'S, a feature 
quantity extracting device 7 extracts feature quantities 
of the gathered data, and a word extracting device 1 1 
extracts words in the vicinity of image information, 
imparts points depending upon how close to the image 
the respective words are, and stores the data and words 
in a database 9 so that they are made to correspond to 
the data feature quantities and points; when retrieval 
is performed, the feature quature quantity of a key 
image is extracted to obtain an approximation order from 
the feature quantity of data in the database 9, images 
relating to a key word are sequenced according to images 
in the database, the approximation order of the key 
image, and words in database matching the key image 



and key word and their points, and retrieval result is 
obtained according to the order. 
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(54) [Title of the Invention] Information retrieving method and 
a recording medium having such an information retrieving 
program. 

(57 ) [Abstract] 

[Object] To provide an information retrieving method for 
retrieving information containing images efficiently and 
appropriately and a recording medium having such an information 
retrieving program. 

[Means to solve the problems] A method having the steps of: 
collecting data containing an image on the WWW by using an 
automatic information collecting device 3; extracting the 
feature amount of this collected data by using a feature-amount 
extraction device 7; extracting words in the vicinity of the 
image information by using a word extraction device 11; adding 
points to the words in the order in which they are closer to 
the image so as to store them in a data base 9 with the data 
and words being associated with the data feature amount and the 
points; upon retrieval, extracting the feature amount of a key 
image, while finding the order of approximation from the feature 
amount of data within the data base 9; and with respect to the 
key image and a key word, based upon the order of approximation 
between the key image and the image within the data base as well 
as the key word and those words within the data base that are 
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coincident with the key word and their points, prioritizing the 
images relating to the key word, thereby obtaining the results 
of the retrieval based upon the order of priority. 

[EMBODIMENTS] 

[0013] FIG. 1 is a block diagram that shows a construction of 
an information retrieving system for executing an information 
retrieving method in accordance with one embodiment of the 
present invention. The information retrieving system shown in 
this Figure is provided with a WWW server 1 connected to the 
Internet so as to carry out normal server functions, and an 
automatic information collecting device 3 that is also 
connected to the Internet, and collects data including images 
on the WWW. The data including images collected by the 
automatic information collecting device 3 is temporarily stored 
in a data base 5 . 

[0014] The collected data, stored in the data base 5, is read 
out by a feature amount extraction device 7 so that the amount 
of feature is extracted, and stored in another data base 9. 
Moreover, the collected data, stored in the data base 5, is also 
read out by a word extraction device 11, and its HTML language 
of the WWW is analyzed so that words located in the vicinity 
of the image information are extracted, and based upon the 
assumption that the closer to the image information the 
extracted word is located, the more closely the word is related 
to the image, higher points are applied to those words in the 
order in which they are closer to the image, and each word with 
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the point is stored in the data base 9 in association with the 
corresponding image information and the amount of feature of 
the image. In other words, in the data base 9, with respect 
to each piece of image information, the amount of feature of 
the image, each word related to the image and the point of each 
word are stored in association with each other. 
[0015] The data including the image information stored in the 
data base 9 is used when desired image information is retrieved 
based upon a key image by an image retrieving device 13 and when 
desired text information is retrieved based upon the key word 
by a text retrieving device 15. The image retrieving device 
13 extracts the amount of feature of the key image inputted by 
the user of the Internet through the WWW server 1, and this 
extracted amount of feature of the key image is collated with 
the amount of feature of the image inside the data server 9 so 
that the order of approximation is outputted. Moreover, the 
text retrieving device 15 collates the key word inputted by the 
Internet user through the WWW server 1 with the words inside 
the data base 9 , and based upon the coincided word and its points , 
prioritizes images related to the key word, and outputs 
information of the order of priority related to the image. 
[0016] The information of the order of approximation of images 
related to the key image outputted from the image retrieving 
device 13 and the information of the order of priority of images 
related to the key word outputted from the text retrieving 
device 15 are supplied to a retrieval result mixing device 17, 
which outputs the results of retrieval based upon the 



information of the order of approximation of images related to 
the key image and the information of the order of priority of 
images related to the key word. The results of retrieval are 
sent to the user from the WWW server 1 . 

[0017] Next, referring to a system structural drawing in FIG. 
2 and a flow chart in FIG. 3, an explanation will be given of 
an information collecting process carried out in the 
information retrieving system arranged as described above. 
[0018] As illustrated in FIG. 2, the information retrieving 
system of the present embodiment, shown in FIG. 1, is connected 
to a plurality of Web sites 23a, 23b, 23n, through the Internet 
21, so that texts written in HTML and image information, stored 
in their data bases 25a, 25b ...2 5n, are collected by the automatic 
information collecting device 3 through the Internet 21, and 
stored in the data base 5 as described above. 
[0019] In this image collecting process, as shown in the flow 
chart of FIG. 3, first, the automatic information collecting 
device 3 of the information retrieving system specifies a 
desired Web site 23 from which information is to be collected, 
and information is collected from the data base 25 of the 
specified Web site 23, and the resulting information is stored 
in the data base 5 (step Sll). After such an information 
collecting process has been carried out on all the Web sites 
(step S13), image files are selected from the collected 
information, and the feature amount extraction device 7 
extracts the amount of feature of each of these images, and the 
resulting data is stored in a data base 9 (step S15). Then, 
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the key word is extracted by analyzing the HTML file, and a point 
is applied to the extracted key word depending on the distance 
from a tag specifying the image, and stored in the data base 
9 (step S17 ) . 
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